Application of gene ontology to gene identification.

نویسندگان

  • Hugo P Bastos
  • Bruno Tavares
  • Catia Pesquita
  • Daniel Faria
  • Francisco M Couto
چکیده

Candidate gene identification deals with associating genes to underlying biological phenomena, such as diseases and specific disorders. It has been shown that classes of diseases with similar phenotypes are caused by functionally related genes. Currently, a fair amount of knowledge about the functional characterization can be found across several public databases; however, functional descriptors can be ambiguous, domain specific, and context dependent. In order to cope with these issues, the Gene Ontology (GO) project developed a bio-ontology of broad scope and wide applicability. Thus, the structured and controlled vocabulary of terms provided by the GO project describing the biological roles of gene products can be very helpful in candidate gene identification approaches. The method presented here uses GO annotation data in order to identify the most meaningful functional aspects occurring in a given set of related gene products. The method measures this meaningfulness by calculating an e-value based on the frequency of annotation of each GO term in the set of gene products versus the total frequency of annotation. Then after selecting a GO term related to the underlying biological phenomena being studied, the method uses semantic similarity to rank the given gene products that are annotated to the term. This enables the user to further narrow down the list of gene products and identify those that are more likely of interest.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification and prioritization genes related to Hypercholesterolemia QTLs using gene ontology and protein interaction networks

Gene identification represents the first step to a better understanding of the physiological role of the underlying protein and disease pathways, which in turn serves as a starting point for developing therapeutic interventions. Familial hypercholesterolemia is a hereditary metabolic disorder characterized by high low-density lipoprotein cholesterol levels. Hypercholesterolemia is a quantitativ...

متن کامل

Molecular Identification of Rare Clinical Mycobacteria by Application of 16S-23S Spacer Region Sequencing

Objective(s) In addition to several molecular methods and in particular 16S rDNA analysis, the application of a more discriminatory genetic marker, i.e., 16S-23S internal transcribed spacer gene sequence has had a great impact on identification and classification of mycobacteria. In the current study we aimed to apply this sequencing power to conclusive identification of some Iranian clinical ...

متن کامل

Identification of specific gene expression after exposure to low dose ionizing radiation revealed through integrative analysis of cDNA microarray data and the interactome

Background: Accumulating reports suggest that the biological effects of low- and high- dose ionizing radiation (LDIR and HDIR) are qualitatively different and might cause different effects in human skin. Materials and Methods: To better understand the potential risks of LDIR, we analyzed three cDNA microarray datasets from the Gene Expression Omnibus database. Results: A pathway analysis showed...

متن کامل

Application of Gene Expression Programming to water dissolved oxygen concentration prediction

This research based on record and collected data from four stations at Eymir Lake, Turkey, which are monitored daily in seven months. Water quality monitoring using former methods are time-needed and expensive, while the application of gene expression programming is more understandable, rapid, and reliable which is used in this article to provide a prediction for dissolved oxygen. The concentra...

متن کامل

Identification of diagnostic biomarkers by bioinformatics analysis in the inflamed and non-inflamed intestinal mucosa in Crohn\'s disease patients

Background: Crohn's disease (CD) is a type of inflammatory bowel disease (IBD) which despite the unknown details is generally related to genetic, immune system, and environmental factors. In this study, we identify transcriptional signatures in patients with CD and then explain the potential molecular mechanisms in inflamed and non-inflamed intestinal mucosa in these patients. Materials and Me...

متن کامل

Molecular phylogeny of some avian species using Cytochrome b gene sequence analysis

Veritable identification and differentiation of avian species is a vital step in conservative, taxonomic, forensic, legal and other ornithological interventions. Therefore, this study involved the application of molecular approach to identify some avian species i.e. Chicken (Gallus gallus), Muskovy duck (Cairina moschata), Japanese quail (Coturnix japonica), Laughing dove (Streptopelia senegale...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Methods in molecular biology

دوره 760  شماره 

صفحات  -

تاریخ انتشار 2011